REMARKS 

This Amendment is in response to the Office Action mailed March 13, 2001 . Claims 1- 
18 are pending and claims 20-24 have been added. Claims 1 and 7 have been cancelled and 
claims 6, 8 and 12 have been amended. Examination of newly added claims 20-24 is 
respectfully requested. 

In the Office Action, claim 6 is objected to as being dependent upon a rejected base claim 
but would be allowable if placed in independent form. Thus, claim 6 has been amended to 
include limitations of prior claim 1. The amendment of claim 6 does not narrow the claim or 
relate to statutory requirements. Instead, the amendment is made to modify the structure of the 
claim as an independent claim. 

Similar, claim 12 has been amended to include limitations of base claim 7. The 
amendment of claim 12 does not narrow the claim or relate to statutory requirements. Instead, 
the amendment is made to modify the structure of the claim as an independent claim. 

Rejection Under 35 U.S.C. S 103 

The Examiner rejects claims 1, 7 and 13 as being unpatentable over Polcvn (U.S. Patent 
No. 5,31 1,588) in view of Nishiguchi (U.S. Patent No. 5,664,052). It is alleged that Polcvn 
suggests/teaches "determining a peak-to-mean ratio. " Applicant respectfully disagrees with 
grounds for rejection and submits that a prima facie case of obviousness has not been satisfied. 
In particular, Polcvn describes peak to average ratios (PAR), which substantially differs from the 
peak-to-mean likelihood ratio (PMLR) as described in page 16, line 1 1 et seq. of the 
specification. 

(APMR max -APMR mn ) 
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The contents of the numerator and denominator forming the PMLR are now claimed in 
newly formed independent claims 6 and 12. Claim 13 has not been amended because such 
amendment is unnecessary in light of the clear lack of teaching or suggestion of using the PMLR 
value to determine whether an audio frame being transmitted represents a voice signal. 

In light of the foregoing, Applicant respectfully requests the Examiner to withdrawal the 
above-cited § 103 rejection and allow claims 6, 12 and 13 and those claims dependent therefrom. 

The Examiner further rejects claims 2-4, 8-12 and 14-18 as being unpatentable over 
Polcvn and Nishiguchi in view of Graumann (U.S. Patent No. 5,737,407). Applicant contends 
that claims 2-4, 8-12 and 14-17 are in condition for allowance merely based on the dependency 
of independent claims 6 and 12. In addition, claim 18 is allowable because neither Polcvn, 
Nishiguchi nor Graumann , alone or in combination, suggest the implementation each claimed 
circuitry listed below. 

(1) circuitry to determine whether a difference between the long-term 
averaged energy and the short-term averaged energy is less than a predetermined 
threshold when the short-term averaged energy is greater than the long-term 
averaged energy by the predetermined factor; 

(2) circuitry to determine a normalized peak-to-mean likelihood ratio 
when the difference between the long-term averaged energy and the short-term 
averaged energy is less than the predetermined threshold; or 

(3) circuitry to comparing the peak-to-mean likelihood ratio to a selected 
threshold and to determine that the audio frame represents a voice signal when the 
peak-to-mean likelihood ratio is greater than a selected threshold. 
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In light of the foregoing, Applicant respectfully requests the Examiner to withdrawal the 
above-cited § 103 rejection and allow claims 2-4, 8-12 and 14-18. Applicant respectfully 
requests the specific cite (column/line number) where such alleged teachings of the above- 
mentioned circuitry may be found in the cited patents if the Examiner still disagrees with 
Applicant after careful reconsideration. 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 



1 1. Cancelled. 

1 2. (Amended) The method of claim 6 [1], wherein prior to determining the 

2 peak-to-mean likelihood ratio, the method further comprises: 

3 determining a short-term averaged energy for the current audio frame; and 

4 determining a long-term averaged energy for the current audio frame. 

1 3. The method of claim 2, wherein after determining the short-term averaged 

2 energy and the long-term averaged energy, the method further comprises: 

3 determining whether a sum of the short-term averaged energy and a factor is greater 

4 than the long-term averaged energy; and 

5 determining that the current audio frame represents silence if the sum is less than the 



6 long-term averaged energy, without necessitating a determination of the peak-to-mean 

7 likelihood ratio. 

1 4. The method of claim 3, upon determining that the sum is greater than the 

2 long-term averaged energy and before determining the peak-to-mean likelihood ratio, the 

3 method further comprises: 

4 determining whether a difference between the long-term averaged energy and the 

5 short-term averaged energy is less than a predetermined threshold; 

6 determining that the current audio frame represents voice if the difference is greater 

7 than the predetermined threshold; and 
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8 continuing by determining the peak-to-mean likelihood ratio if the difference is less 

9 than the predetermined threshold. 

1 5. The method of claim 2, wherein the determining of the short-term averaged 

2 energy comprises: 

3 determining an energy, in decibels, of the current audio frame; 

4 determining a short-term averaged energy for a prior audio frame; and 

5 conducting a weighted average of the energy of the current audio frame and the short- 

6 term averaged energy for the prior audio frame. 

1 6. (Twice Amended) A [The] method for enhancing voice activity detection 

2 [of claim 1, wherein] comprising : 

3 determining a peak-to-mean likelihood ratio, the determining a peak-to-mean 

4 likelihood ratio comprises 

5 calculating an averaged peak-to-mean ratio for the current audio frame a [;] 

6 determining a maximum averaged peak-to-mean ratio A [;] 

7 determining a minimum averaged peak-to-mean ratio^;] 

8 determining a difference between the maximum averaged peak-to-mean ratio and the 

9 averaged peak-to-mean ratio for the current audio framed;] 

10 determining a difference between the maximum averaged peak-to-mean ratio and the 

1 1 minimum averaged peak-to-mean ratio A [;] and 

12 conducting a ratio, a denominator of the ratio being the difference between the 

13 maximum averaged peak-to-mean ratio and the minimum averaged peak-to-mean ratio, the 



003239.P010 

App. No. 09/134,272 



-14- 



WWS/crr 
Filed: 08/14/98 




14 numerator being the difference between the maximum averaged peak-to-mean ratio and the 

1 5 averaged peak-to-mean ratio ; and 

16 comparing the peak-to-mean likelihood ratio to a selected threshold to determine 

17 whether a current audio frame represents a voice signal . 

1 7. Cancelled. 

1 8. (Amended) The communication module of claim 12 [71, wherein the voice 



2 activity detector, when executed, controls the processing unit to determine whether a sum of 

3 the short-term averaged energy and a predetermined factor is greater than the long-term 

4 averaged energy, and to signal that the current audio frame represents silence if the sum is 



5 less than the long-term averaged energy. 

1 9. The communication module of claim 8, wherein the voice activity detector, 

2 when executed, controls the processing unit to determine whether a difference between the 

3 long-term averaged energy and the short-term averaged energy is less than a predetermined 



4 threshold, and to signal that the current audio frame represents voice if the difference is 

5 greater than the predetermined threshold. 



1 10. The communication module of claim 9, wherein the voice activity detector, 

2 when executed, controls the processing unit to determine the peak-to-mean likelihood ratio, 

3 and to compare the peak-to-mean likelihood ratio to a selected threshold to determine 

4 whether a current audio frame represents a voice signal. 
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1 11. The communication module of claim 1 0, wherein the voice activity detector, 

2 when executed, controls the processing unit to determine a peak-to-mean ratio by (i) 

3 sampling an analog signal a predetermined number of times to produce a plurality of sampled 

4 signals each having a sampled value, (ii) determining a maximum value of the plurality of 

5 sampled signals, and (iii) conducting a ratio between an absolute value of the maximum 

6 value and a summation of the sampled values for the plurality of sampled signals. 

1 12. (Amended) A [The] communication module [of claim 10, wherein] 

2 a substrate; 

3 a processing unit placed on the substrate; and 

4 a memory coupled to the processing unit, the memory to contain a voice activity 

5 detector which, when executed, controls the processing unit to determine an averaged peak- 



6 to-mean ratio for the current audio frame by (i) monitoring a maximum averaged peak-to- 

7 mean ratio and a minimum averaged peak-to-mean ratio, (ii) determining a first result being a 

8 difference between the maximum averaged peak-to-mean ratio and the averaged peak-to- 

9 mean ratio for the current audio frame, (iii) determining a second result being a difference 

10 between the maximum averaged peak-to-mean ratio and the minimum averaged peak-to- 

1 1 mean ratio, and (iv) conducting a ratio between the first result and the second result to 

12 produce the peak-to-mean likelihood ratio. 

1 13. (Amended) A machine readable medium having embodied thereon a 

2 computer program for processing by a machine, the computer program comprising: 

3 a first routine for determining a normalized peak-to-mean likelihood ratio; and 
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4 a second routine for comparing the peak-to-mean likelihood ratio to a selected 

5 threshold to determine whether an audio frame being transmitted represents a voice signal. 

1 14. The machine readable medium of claim 13, wherein the computer program 

2 further comprising: 

3 a third routine for determining a short-term averaged energy for the audio frame, the 

4 third routine being executed before the first and second routines; and 

5 a fourth routine for determining a long-term averaged energy for the audio frame, the 

6 fourth routine being executed before the first and second routines. 

1 15. The machine readable medium of claim 14, wherein the computer program 

2 further comprising: 

3 a fifth routine for determining whether a sum of the short-term averaged energy and a 

4 predetermined factor is greater than the long-term averaged energy, the fifth routine being 

5 executed before the first and second routines; and 

6 a sixth routine for determining whether a difference between the long-term averaged 



7 energy and the short-term averaged energy is less than a predetermined threshold, the sixth 

8 routine being executed after determining that the sum is greater than the long-term averaged 

9 energy and before execution of the first and second routines. 

1 16. The machine readable medium of claim 15, wherein the fifth routine 

2 determining that the current audio frame represents silence if the sum is less than the long- 

3 term averaged energy. 
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1 17. The machine readable medium of claim 1 5, wherein the sixth routine 

2 determining that the current audio frame represents voice if the difference is greater than the 

3 predetermined threshold. 

1 18. (Amended) A voice activity detector comprising: 

2 circuitry to determine a short-term averaged energy for an audio frame; 

3 circuitry to determine a long-term averaged energy for the audio frame; 

4 circuitry to determine whether the short-term averaged energy is greater than the 

5 long-term averaged energy by a predetermined factor; 

6 circuitry to determine whether a difference between the long-term averaged energy 



7 and the short-term averaged energy is less than a predetermined threshold when the short- 

8 term averaged energy is greater than the long-term averaged energy by the predetermined 

9 factor; 



10 circuitry to determine a normalized peak-to-mean likelihood ratio when the difference 

1 1 between the long-term averaged energy and the short-term averaged energy is less than the 

12 predetermined threshold; and 

13 circuitry to comparing the peak-to-mean likelihood ratio to a selected threshold and to 

14 determine that the audio frame represents a voice signal when the peak-to-mean likelihood 

15 ratio is greater than a selected threshold. 

1 20. (New) A method for enhancing voice activity detection comprising: 

2 determining a peak-to-mean likelihood ratio including (i) a denominator having a 

3 value substantially equal to a difference between a maximum averaged peak-to-mean ratio 
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4 and a minimum averaged peak-to-mean ratio and (ii) a numerator having a value 

5 substantially equal to a difference between the maximum averaged peak-to-mean ratio and 

6 the averaged peak-to-mean ratio; and 

7 comparing the peak-to-mean likelihood ratio to a selected threshold to determine 

8 whether a current audio frame represents a voice signal. 

1 21. (New) The method of claim 20, wherein prior to determining the peak-to- 

2 mean likelihood ratio, the method further comprises: 

3 determining a short-term averaged energy for the current audio frame; and 

4 determining a long-term averaged energy for the current audio frame. 

1 22. (New) The method of claim 2 1 , wherein after determining the short-term 

2 averaged energy and the long-term averaged energy, the method further comprises: 

3 determining whether a sum of the short-term averaged energy and a factor is greater 

4 than the long-term averaged energy; and 

5 determining that the current audio frame represents silence if the sum is less than the 

6 long-term averaged energy, without necessitating a determination of the peak-to-mean 

7 likelihood ratio. 

1 23. (New) The method of claim 22, upon determining that the sum is greater than 

2 the long-term averaged energy and before determining the peak-to-mean likelihood ratio, the 

3 method further comprises: 

4 determining whether a difference between the long-term averaged energy and the 

5 short-term averaged energy is less than a predetermined threshold; 
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6 - determining that the current audio frame represents voice if the difference is greater 

7 than the predetermined threshold; and 

8 continuing by determining the peak-to-mean likelihood ratio if the difference is less 

9 than the predetermined threshold. 

1 24. (New) The method of claim 2 1 , wherein the determining of the short-term 

2 averaged energy comprises: 

3 determining an energy, in decibels, of the current audio frame; 

4 determining a short-term averaged energy for a prior audio frame; and 

5 conducting a weighted average of the energy of the current audio frame and the short- 

6 term averaged energy for the prior audio frame. 
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CONCLUSION 

In view of the amendments and remarks made above, it is respectfully submitted that all 
pending claims are in condition for allowance, and such action is respectfully solicited. It is 
respectfully requested that the Examiner contact the undersigned attorney in order to facilitate 
prosecution of the subject application. 



Dated: June 13, 2001 



Respectfully submitted, 

Blakely, SokoloffJTaylor & Zafman LLP 
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